Comprehensive Minimal Dependency Approach to Lean Annotation of Morphosyntactic Phenomena
نویسنده
چکیده
Viewing linguistic modelling as annotation of a variety of phenomena becomes an increasingly popular perspective in natural language processing and grammar engineering, and it changes the way grammar modularity is understood nowadays. In this contribution we shall concentrate on a generalisation of the notion of dependency in combination with a rich typology of observable relationships holding between linguistically relevant items. The result is a comprehensive minimal dependency approach to lean annotation of morphosyntactic phenomena. Although the pre-theoretical ontology of relational types has been developed on the basis of Slavic morphosyntax, it provides a meta-annotation scheme which — by design — is meant to be compatible with theory-specific annotation schemes. Due to a phenomenadriven setting, the lean annotation approach represents one plausible strategy of introducing systematicity into the interpretation of linguistic data while remaining non-committal in theoretical disputes.
منابع مشابه
An annotation scheme for Persian based on Autonomous Phrases Theory and Universal Dependencies
A treebank is a corpus with linguistic annotations above the level of the parts of speech. During the first half of the present decade, three treebanks have been developed for Persian either originally or subsequently based on dependency grammar: Persian Treebank (PerTreeBank), Persian Syntactic Dependency Treebank, and Uppsala Persian Dependency Treebank (UPDT). The syntactic analysis of a sen...
متن کاملOntology-Based Lexicon of Bulgarian
In contrast to morphological and syntactic processing semantic annota tion based on domain ontology is still underdeveloped for Bulgarian. On the other hand, the prerequisites for an ontological annotation are already available. These are as follows: a morphosyntactic tagger for Bulgarian with more than 95% accuracy; a dependency parser with more than 84% accura cy; a general chunker and a na...
متن کاملExploring Morphosyntactic Annotation over a Spanish Corpus for Dependency Parsing
It has been observed that the inclusion of morphosyntactic information in dependency treebanks is crucial to obtain high results in dependency parsing for some languages. In this paper we explore in depth to what extent it is useful to include morphological features, and the impact of diverse morphosyntactic annotations on statistical dependency parsing of Spanish. For this, we give a detailed ...
متن کاملDown-stream effects of tree-to-dependency conversions
Dependency analysis relies on morphosyntactic evidence, as well as semantic evidence. In some cases, however, morphosyntactic evidence seems to be in conflict with semantic evidence. For this reason dependency grammar theories, annotation guidelines and tree-to-dependency conversion schemes often differ in how they analyze various syntactic constructions. Most experiments for which constituent-...
متن کاملRepresentation of Morphosyntactic Units and Coordination Structures in the Turkish Dependency Treebank
This paper presents our preliminary conclusions as part of an ongoing effort to construct a new dependency representation framework for Turkish. We aim for this new framework to accommodate the highly agglutinative morphology of Turkish as well as to allow the annotation of unedited web data, and shape our decisions around these considerations. In this paper, we firstly describe a novel syntact...
متن کامل